Improving Chinese SRL with Heterogeneous Annotations
نویسندگان
چکیده
Previous studies on Chinese semantic role labeling (SRL) have concentrated on single semantically annotated corpus. But the training data of single corpus is often limited. Meanwhile, there usually exists other semantically annotated corpora for Chinese SRL scattered across different annotation frameworks. Data sparsity remains a bottleneck. This situation calls for larger training datasets, or effective approaches which can take advantage of highly heterogeneous data. In these papers, we focus mainly on the latter, that is, to improve Chinese SRL by using heterogeneous corpora together. We propose a novel progressive learning model which augments the Progressive Neural Network with Gated Recurrent Adapters. The model can accommodate heterogeneous inputs and effectively transfer knowledge between them. We also release a new corpus, Chinese SemBank, for Chinese SRL. Experiments on CPB 1.0 show that ours model outperforms state-of-the-art methods.
منابع مشابه
A Progressive Learning Approach to Chinese SRL Using Heterogeneous Data
Previous studies on Chinese semantic role labeling (SRL) have concentrated on a single semantically annotated corpus. But the training data of single corpus is often limited. Whereas the other existing semantically annotated corpora for Chinese SRL are scattered across different annotation frameworks. But still, Data sparsity remains a bottleneck. This situation calls for larger training datase...
متن کاملImproving Nominal SRL in Chinese Language with Verbal SRL Information and Automatic Predicate Recognition
This paper explores Chinese semantic role labeling (SRL) for nominal predicates. Besides those widely used features in verbal SRL, various nominal SRL-specific features are first included. Then, we improve the performance of nominal SRL by integrating useful features derived from a state-of-the-art verbal SRL system. Finally, we address the issue of automatic predicate recognition, which is ess...
متن کاملImproving Chinese Semantic Role Labeling with Rich Syntactic Features
Developing features has been shown crucial to advancing the state-of-the-art in Semantic Role Labeling (SRL). To improve Chinese SRL, we propose a set of additional features, some of which are designed to better capture structural information. Our system achieves 93.49 Fmeasure, a significant improvement over the best reported performance 92.0. We are further concerned with the effect of parsin...
متن کاملImproving Chinese Semantic Role Labeling with English Proposition Bank
Most researches to SRL focus on English. It is still a challenge to improve the SRL performance of other language. In this paper, we introduce a twopass approach to do Chinese SRL with a Recurrent Neural Network (RNN) model. We use English Proposition Bank (EPB) to improve the performance of Chinese SRL. Experimental result shows a significant improvement over the stateof-the-art methods on Chi...
متن کاملLearning Chinese language structures with multiple views
Motivated by the inadequacy of single view approaches in many areas in NLP, we study multi-view Chinese language processing, including word segmentation, part-of-speech (POS) tagging, syntactic parsing and semantic role labeling (SRL), in this thesis. We consider three situations of multiple views in statistical NLP: (1) Heterogeneous computational models have been designed for a given problem;...
متن کامل